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Abstract 

We prove an almost sure invariance principle (approximation by d-dimensional 
Brownian motion) for vector- valued Holder observables of large classes of nonuniformly 
hyperbolic dynamical systems. These systems include Axiom A diffeomorphisms and 
flows as well as systems modelled by Young towers with moderate tail decay rates. 

In particular, the position variable of the planar periodic Lorentz gas with finite 
horizon approximates a 2-dimensional Brownian motion. 

1 Introduction 

The scalar almost sure invariance principle (ASIP), or approximation by one-dimensional 
Brownian motion, is a strong statistical property of sequences of random variables introduced 
by Strassen |^ HT] . It implies numerous other statistical limit laws including the central 
limit theorem, the functional central limit theorem, and the law of the iterated logarithm. 
See IHH] and references therein for a survey of consequences of the ASIP. 

The scalar ASIP has been shown to hold for large classes of dynamical systems IT7| 
eg ini 1211 123 EHl ED ESI- Chernov & Dolgopyat fTUl, Problem 1] asked for a proof of the 
ASIP for R'^-valued observables, and it is this problem that is solved in this paper. Our main 
result applies to a large variety of dynamical systems, as surveyed in Section HJ 

As a secondary matter, we obtain explicit error estimates that depend on the dimension 
d and the lack of hyperbolicity. Even for = 1, this estimate is better than those in almost 
all of the above references. The exception is |i21j| which gives the best available estimate for 
scalar ASIPs for a restricted class of systems. 
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1.1 Statement of the main results 

Definition 1.1 A sequence {Sn} of random variables with values in M°' satisfies a d- 
dimensional almost sure invariance principle (ASIP) if there exists A > and a probability 
space supporting a sequence of random variables S'^ and a li- dimensional Brownian motion 
W{t) such that 

(a) {SN;N>l}=,{S%;N>l},and 

(b) S^f = W{N) + 0{N^~^) as iV ^ oo almost everywhere. 

For brevity, we write = W{N) + 0{N^~^) a.e. The ASIP for a one-parameter family 5*^ 
of M'^-valued random variables is defined similarly, and denoted St = W{T) + 0{T"^) a.e. 

Remark 1.2 The ASIP is said to be nondegenerate if the Brownian motion W{t) has non- 
singular covariance matrix E. For the classes of dynamical systems considered in this paper, 
the ASIP is nondegenerate for typical observables. More precisely, there is a closed subspace 
Z of infinite codimension in the space of all (piecewise) Holder M'^-valued observables such 
that S is nonsingular whenever (p ^ Z. (By considering all one-dimensional projections it 
suffices to consider the case d = 1. This is done explicitly in for example [23 Section 4.3].) 



Axiom A diffeomorphisms and flows Our results are most easily stated in the uni- 
formly hyperbolic (Axiom A) context. 

Theorem 1.3 Let f : M ^ M be a diffeomorphism with a (nontrivial) uniformly hyperbolic 
basic set X C M, and suppose that /x is an equilibrium measure corresponding to a Holder 
potential. Let (p : X ^ be a mean zero Holder observable with partial sums = 
'^n=i 'P ° ■ Then for any e > 0, 

Sj^ = W{N) + 0{N^+') a.e. where (3 = f^. 

(The improved estimate P = \ holds when d = 1 121].) 

An immediate consequence (see for example fTj is the corresponding result for 
Axiom A flows. 

Corollary 1.4 Let ft'.M^Mbea smooth flow with a (nontrivial) uniformly hyperbolic 
basic set X C M , and suppose that /i is an equilibrium measure corresponding to a Holder 
potential. Let (p : X ^ be a mean zero Holder observable with partial sums Sx — 
/o 4' ° ft dt. Then for any e > 0, 

St = W{T) + 0{T^+') a.e. where (3 = fgf . 

(The improved estimate P = \ holds when d = 1 WJ[ U^.) I 
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Remark 1.5 Denker & Philipp ^7] proved Theorem ll.3l and Corollary II .41 in the case d = 1 
(though with a weaker error term). 

Nonuniformly hyperbolic systems Our results apply also to maps f : M —>■ M that 
are nonuniformly expanding/hyperbolic in the sense of Young |^ |^. Roughly speaking, 
such maps possess a subset A C M and a return time : A — > Z"*" such that the induced 
map : A — > A is uniformly hyperbolic. 

Theorem 1.6 Let f : M ^ M be a diffeomorphism (possibly with singularities) that is 
nonuniformly hyperbolic in the sense of Young \44\ \J^ - In particular, f satisfies condi- 
tions (A1)-(A4) in Section \4.^ and possesses an SRB measure m. Assume that the return 
time function R lies in U' , p > 2. Let (p : M ^ be a mean zero Holder observation with 
partial sums Sjy = "^^=1 'P ° ■ Then for any e > 0, 

= W{N) + 0{N'^+') a.e. where (3 = 

If d = 1 then (3 can be improved to P = ^ + ^ for 2 < p < 4 and = | for p > 4. 

Again, there is an immediate extension to nonuniformly hyperbolic flows. Suppose that 
f : M —>■ M satisfies the assumptions of Theorem 11.61 with R G L^, p > 2, and that ft is a 
suspension flow over / with a (uniformly bounded) Holder roof function. By [S2] , M'^- valued 
Holder observables of the suspension flow satisfy an ASIP of the form 5*^ = W{T) + 0(T^"'"'') 
a.e. where f3 is as in Theorem 11.61 

Application to Lorentz gases The planar periodic Lorentz gas was introduced by 
Sinai (39|. This is a three-dimensional flow with phase space (M^ — Q)x S^, where C is 
a periodic array of disjoint convex regions with boundaries. The coordinates are position 
g G — r2 and velocity f G 5*^. The flow satisfies the finite horizon condition if the time 
between collisions with dQ is uniformly bounded. 

Let q(t) G denote the position at time t of a particle starting at position g(0) pointing 
in direction f(0). Bunimovich & Sinai P, see also 0, proved that q{t) satisfies a two- 
dimensional functional central limit theorem (weak invariance principle) supporting the view 
of such flows as a deterministic model for Brownian motion. We complete this circle of ideas 
by proving the strong version of this result. 

Theorem 1.7 Consider a planar periodic Lorentz gas satisfying the finite horizon condition. 
Let e > 0. There is a two-dimensional Brownian motion W{t) with nonsingular covariance 
matrix such that for almost every initial condition, q{T) = W{T) + 0(Tt5+'^). 

Remark 1.8 (a) A number of authors jHl EOl ISH] have independently established scalar 
ASIPs for one-dimensional projections of q{t). In hindsight, the scalar ASIP for the Lorentz 
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gas follows from earlier work of [21], again with (5 = \. (We note that the methods of [21] 
apply in the first place only to the time-reversal of the dynamical system under study. Their 
applicability here is due to the fact that the class of systems is closed under time-reversal.) 

(b) The finite horizon condition is crucial. For infinite horizons, Szasz & Varjii 02] prove 
that q{t) lies in the nonstandard domain of the normal distribution. In particular, the central 
limit theorem fails, hence the ASIP fails. 

1.2 Consequences of the vector-valued ASIP 

For convenience, we suppose that the Brownian motion in the ASIP is nondegenerate. Co- 
ordinates can be chosen on so that W{t) is a standard d-dimensional Brownian motion 
with S = Id. Throughout, the norm on is taken to be the usual Euclidean norm. The 
following consequences of the ASIP are summarised in [SJ P- 233]. Here, LIL stands for law 
of the iterated logarithm and the functional LIL stated below is a far-reaching generalisation, 
due to Strassen, of the classical LIL. 

Proposition 1.9 For the dynamical systems to which the results in this paper apply, the 
following consequences hold (after normalisation so that S = Id): 

• Functional LIL Let C = C([0, 1], M'^) he the Banach space of continuous maps f : [0, 1] — > 

with the supremum norm. Let K he the (compact) set of f & C ahsolutely continuous with 
/(O) = 0, ^Q\f'{t)\'^dt < 1. Define fn{i/n) = Si/ y/2n log logn, i = 0, . . . , and linearly 
interpolate to ohtain fn G C. Then a.s. the sequence {fn} is relatively compact in C and its 
set of limit points is precisely K. 

• Upper and lower class refinement of the LIL Let (f)[t) : M. ^ he a positive 
nondecreasing function. Then 

P{\Sn\ > N^{N) z.o.) = or 1 

according to whether exp(— du converges or diverges. 

• Upper and lower class refinement of Chung's LIL Let (f){t) : M ^ M 6e a positive 
nondecreasing function. Then there is a constant c ( depending only on d) such that 

P(max|S'„| < cN^-\N) i.o.) = or 1 

n<N 

according to whether exp(— </)^(m)) du converges or diverges. 

• Central limit theorem and functional central limit theorem ■ 

Remark 1.10 (a) Berger 4j gives a unified approach to the ASIP for weakly dependent 
sequences of random variables with values in a real separable Banach space, but with error 
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term o(\/iVTogToglV). It follows from Berger jU Corollary 4.1, part A. 5] and Melbourne 
& Nicol jnO] that the Banach space-valued ASIP formulated in [4, Theorem 3.2] holds for 
all dynamical systems considered in this paper. In particular, the M'^-valued ASIP holds 
with error term o(v^iVTogloglV). This error term suffices for the functional LIL, but is 
inadequate for the upper and lower class refinements and for the (functional) central limit 
theorem; whereas the error term established in this paper suffices. Indeed this was the 
original motivation of Jain et al. [221 to improve the error term in Strassen's scalar ASIP. 

(b) The M'^- valued functional central limit theorem, being a distributional result, can be 
proved directly under the more general condition i? e in Theorem 11.61 reduce as in this 
paper to the setting in Section |H1 and then apply the method of Section 3.3]. 

We end this section by discussing briefly the probabilistic methods used in this paper. 
Strassen's original proof of the scalar ASIP for IIDs and martingales 1^ relies heavily 
on the Skorokhod embedding theorem for scalar stochastic processes. This method was 
extended to weakly dependent sequences of random variables by a number of authors, using 
blocking arguments to reduce to the martingale case, see [38 • In particular, Philipp & 
Stout [SHI Theorem 7.1] formulated a version of the scalar ASIP which is particularly useful 
for dynamical systems [211 El 1^0] • 

Attempts to extend Strassen's proof to the M'^-valued situation were only partially suc- 
cessful [2Z1 , and the same is true for the completely different quantile transform method of 
Csorgo & Revesz Eventually, Berkes & Philipp 5J introduced a third method which 
works in any number of dimensions, and the applicability of this method was extended to 
weakly dependent sequences by Kuelbs & Philipp [28 . 

The remainder of the paper is organised as follows. In Section |2l we combine the blocking 
argument in |3H] with the results of |H1 I2H] to prove a vector-valued ASIP for R'^-valued 
random variables satisfying certain hypotheses. In Section |2l we verify these hypotheses 
for Gibbs-Markov maps and derive Theorem 11.31 as a consequence. In Section HJ we flrst 
prove the ASIP for nonuniformly expanding maps and then prove Theorems 11.61 and II . 71 We 
also list numerous other situations to which our results apply, and we mention some open 
problems regarding time-one maps of flows. 

2 A vector-valued ASIP for functions of mixing se- 
quences 

In this section, we prove a vector-valued ASIP for M'^-valued random variables satisfying 
certain hypotheses. This is the vector- valued analogue of fSS', Theorem 7.1] though with 
hypotheses tailored to the dynamical systems setting. (A result of this type is hinted at 
in Kuelbs & Philipp [2Hl! but it is necessary to work through the details to determine the 
hypotheses, which were left unstated. In any case, the estimates in ()2.3|) and ()2.5p are not 
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so natural in the probabilistic setting in [2H|, and partly account for our strong error term.) 



2.1 Statement of the ASIP 

Let ^1, • • • be a sequence of real-valued random variables and let JFJ = a < n < b}. 
We assume the strong-mixing condition 

\P{AB) - P{A)P{B)\ < Ct" for all A G J'f' and B e J^^^^. (2.1) 

Let p > 2, and let rj^ be a strictly stationary sequence of jF^-measurable M'^- valued 
random variables satisfying 

Er]^ = and \Vn\p = C, (2.2) 
and the (backwards) Burkholder-type inequality 

^^^\En-i^n\ <CN"^. (2.3) 

Define rjin = E{rin\J-'n~^^). We require that 

\Vn-Ven\p<CT^. (2.4) 

Let S be a symmetric positive semidefinite dx d covariance matrix. Given u G M'^, define 
fwiu) = E exp{i{u,J2n<NVn/y/N)) and g{u) = exp(— Su)). Assume that there exists 
e > such that 

\fN{u) - g{u)\ < CN-'^ for all \u\ < em. (2.5) 

— +2c/H~3 r \ 

Theorem 2.1 Assume conditions flTTll-flTKl). Let (3 = \^_^^ G f^,|), and let e > 0. 

Then there is a d- dimensional Brownian motion W{t) with covariance matrix S such that 
En<NVn = W{N) + 0{NP^^) a.e. 

Remark 2.2 (a) For c? = 1, we obtain under similar hypotheses, but using a different 
method, the improved error estimate /3=^ + ifor2<p<4 and /3 = | for p > 4. See 
Appendix |X1 

(b) It is evident from the proof of Theorem 12 . II that the exponential rates in 1)2.111 and ()2.4j) 
can be replaced by sufficiently high polynomial rates. Further relaxing of the assumptions 
is possible at the cost of obtaining a weaker estimate in Theorem 12.11 
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2.2 Preliminaries 

The following result of [ISIESl is stated as jHHl Lemma 7.2.1] 

Lemma 2.3 Let J^,g be a-fields and 13 > Q such that \P{AB) - P{A)P{B)\ < (3 for all 
A e J^, B e g. Letp,q,r > 1 satisfy 1 + i + i = 1. Suppose that ^ G L''{T), r] G L%g). 

Then \E{^r]) - E{OE{v)\ < lOp-^UWrMs- 

The next result is due to Dvoretsky |Tn], see [23 Lemma 2.2]. 

Lemma 2.4 Let JF, g be a-fields. Let C, be a complex-valued J-" -measurable random variable 
with 1^1 < 1. Then E\E{^\g) - E^\ < 27r sup^^^^^gg \PiAB) - P{A)P{B)\. 

The following Gal-Koksma strong law ^22j is stated in |H8| Theorem Al]. 

Lemma 2.5 Let be a sequence of random variables with E^j = 0, and let q > 0. Suppose 
that E\ EJ=m^jP < n'i - ml for all n > m > 1. For any e > 0, Ejli^i < ^2+' a.e. 

2.3 Introduction of the blocks 

Fix Q > a > 0. Define random variables yi, Zi,y2, Z2 . . . consisting of sums of consecutive 
Viin),n where the j'th blocks yj and Zj consist of [j*^] and [j"] such terms respectively, and 
throughout the j'th blocks i{n) = [^j"]. 

In other words, yj = J2n Ve,n, where i = [|j°] and the sum ranges over J2i=ii['^'^] + [^°]) < 
n < Eti^([^^] + + [j% Similarly for Zj. 

Let = a{yj; a < j < b} and C"^ = a{zj; a<j< b}. 

Lemma 2.6 There exists (a modified) r G (0, 1) such that for all k,n > 1, 

\PiAB) - P{A)P{B) \ < r('^+")'' for all A e C\ and B & 
The same is true for all A ^ C\ and B G C^j^^- 

Proof Note that C\ is defined using yi, . . . ,yk which are defined using rjin with i < [|fc"], 

^ — Yli=ii['''^] + [^"]) + [k'^]- This involves conditioning on with n < YliZiil'^^] + [^"]) + 
[k^] + Ijk"']. Similarly for and we obtain 

Hence |P(A5) - P{A)P{B)\ < where = Ej=fcT/([^^] + [^D + [k"] - [i'^l + 1- For all 
A;, n > 1, we compute that ^ (A; + n)° as required for the first statement. (Note that the 
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details for the cases n = 1 and n > 2 are slightly different.) The second statement is proved 
in the same way. I 

For > 1, let i/Mj^, zmj^ be the pair of blocks that contains r]ii^N),N- Write 

In particular, Pm^-i <N< Pm^, and Pa,^ -Pa/^-i = [M^] + [M^] ~ M^. It is immediate 
that 

Proposition 2.7 Writing M = Mn, we have M^^^ ~ Ylj<M3^ ~ particular, 
Pm-Pa/-i ~ Ar«/(i+«). " I 

Proposition 2.8 Y.n>i l'7n - '7fe|p < oo- 

Proof Focusing on the M'th block, and applying ()2.4j) . we obtain 'Yl,PM-x<n<PM 1^" ~ 
Vi{n),n\p M*3r5*^° which is summable. ■ 

Proposition 2.9 \yj\p ^ j^'^ and \zj\p <^j^°'. 

Proof Write yj = ^* rjin where J2* = J2'n=aj+i- By Proposition l2.81 ()2.3p and stationarity, 
IVjlp < I T.*iVen - Vn)\p + I '7n|p < 1 + J^'^ < j^Q. Similarly for zj. I 



2.4 Approximation result 

Theorem 2.10 Let [3 = max{i + ^Q, |}/(1 + Q). For any e > 0, there exists a > such 
that En<iv^n - ^j<A% Vj < 
Begin by writing 

Pm^ 

n<N j<MN n<PM^ j<Mi^ n=N+l j<MM 

In the next three lemmas, we estimate these three terms (following [HHl Lemmas 7.3.2, 7.3.3, 
7.3.4]). The result follows by combining these estimates. 

Lemma 2.11 En<PM^ Vn - J^jKM^iVj + ^j) < 1 

Proof By Proposition EH En<oo \Vn-Ven\ < oo a.e. Hence | En<PM ^"~^i<M(2/i + ^i)l = 

I En<PM(^" - ^te)| < En<oo \Vn " Ven\ < 1 a.e. ■ 
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Lemma 2.12 Let /? = (| + + Q)- For any e > 0, Ej<M^ < 

Proof We have \zj\p <^ and so Ylm-^^] Sm^"" n^+" — m^~^°'. By Lemmas EI 
and 12.61 (with f = r*^ where e = 1 — 2/p), for all i < j, 

\EziZj\ <^ |2;j|p|zj|pf-'" < (i^f'^j^f-^'")^ 

which is summable over (i, j) G N^. We have shown that E{YJ'.^^Zjf < 71^+" - 7/1^+", for 
all 1 < m < n. By Lemma X]j<M -^j ^ ^-^^ for 7 > ^(1 + a), and the result follows 
from Proposition 12.71 I 

Lemma 2.13 Let = + \Q)/{1 + Q)- For any e > 0, Yfn=N+i Vn < N^"-' a.e. 

Proof Let Am = ^s.xp^,j_^+i<n<Pm \Yln=N+iVn\- By (j2ISI) and stationarity, |Aa/|p < 
(Pa/ - Pm-i)^ < M^Q. Hence 

which is summable provided 7 > - + \Q. By Borel-Cantelli, <^ a.e. and the result 

follows from Proposition 12.71 I 



2.5 Proof of Theorem EH] 

We follow the argument of Kuelbs & Philipp |2H1 which extends Berkes & Philipp 0. Let 

Xj = [j'^]~^yj- Note that C\ is an increasing sequence of a-fields such that Xj is C{- 
measurable. 

Proposition 2.14 Let 7 G (0, T/iere exists e > sfic/i i/iai E|£'(exp(i(M, Xj))|£{"^) — 
exp(— < C'p^^^ for all m G M'' satisfying \u\ < ep . 

Proof Let fNiu) = E exp{i{u,^^^jyrin/VN)), g{u) = exp{—^{u,Tiu)), and write 

E{exp(^(M,X,))|£r^}-(7(ix)= (i5;{exp(2(M,X,))|£r'}-^exp(z(M,X,))) 

+ (^Eexp{t{u, Ifrhj)) -Eexp{t{u, [fr'^J2n<[jQ]Vn))) + {f[M^) " 9iu)) 
= 1 + 11 + III. 
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Using Lemmas 12.41 and 12.61 E\I\ <^ t^" . Also, /// is estimated by (j2.5p so it remains to 
estimate //. Write i/j = J2* Vin where J2* = J2n=a''+i- By stationarity and Proposition] 



|//| = \E{expiz{u, [fr'^E^n)) - exp{z{u, [j«]-^EV)))l 

< erij"^]-"^ I En>liVin -Vn)\l « T'^'^ 



as required. 



Proposition 2.15 Let G be the distribution function o/A^(0, S). Then G{u : \u\ > T} < 

Proof This is a straightforward calculation, see for example [SI p. 43]. I 

Let Xj = C'p~^^, Tj = ep , where 7 G (0, ^Q) is chosen below. By Propositions 12.14) 
and 12.151 we have 

E\E{eicp{i{u,Xj))\d^'^} - g{u)\ < Xj for all \u\ < Tj, 
G{u : \u\ > \Tj} < 6j, 

where 5j = e~^'^'^ . These are the hypotheses of Theorem 1]. Defining 

= WdTf^ logTj + 4a|t/ + 6,, 
as in we have a,- <^ j~'^logj + j('^+|)T^3'3j which is summable provided 1 < 7 < *^ 1^ . 

£1+2 

We take 7 slightly larger than 1 and Q slightly larger than 4(i + 6 so that aj ^ J^*-^^^-*- 

Applying [31 Theorem 1], we conclude that (passing to a richer probability space) there 
is a sequence of i.i.d. random variables Yj with distribution iV(0, S) such that 

\Xj -Yj\ <^j-'^^+'^ a.e. 

Let W{t) be a Brownian motion with covariance S and define Y* = [jQ]-2{W{hj)-W{hj-i)) 
where hj = Yln=i[''^^] ~ j^^^- Then {Yj} =rf {Y*} and without loss (after passing to a richer 
probability space), Yj = Y*. We have 

E,<M%- = E,<Mb'^]^^. = J:,<m[j'']Hx, - Y,) + E,<M w{h,) - w{h,.,) 
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Now 

E,<Mb^]^(^. - y^) « E,<M « E,<M J^^-^ « « ivfo/d+o). 

If /iM > N, then hu-N < Pm-Pm-i < iVf^, whereas iihu < N then N-hu < PM-hu = 
EjXAfb'"] < Mi+". Hence /im - iV < ^max{Q,i+a}/(i+Q)_ Taking a small, we obtain 
W{hM) = W{N) + 0(Armax{|Q i}/(i+Q)+e^^ Combining these estimates with Theorem CTH 
we obtain 

Taking Q slightly larger than Ad + 6 yields the required result. I 



3 ASIP for Gibbs- Markov maps 

In this section we prove the ASIP for weighted Lipschitz R'^-valued observables of Gibbs- 
Markov maps. Roughly speaking, these are uniformly expanding maps with countably many 
inverse branches and good distortion properties, and have been studied extensively in |jy. 
We derive Theorem 11.31 as a consequence. 



3.1 Gibbs-Markov maps 

Let (A,m) be a Lebesgue space with a countable measurable partition a. Without loss, we 
suppose that all partition elements a G a have m(a) > 0. Recall that a measure-preserving 
transformation F : A — A is a Markov map if Fa is a union of elements of a and -F|a is 
injective for all a G a. Define a' to be the coarsest partition of A such that Fa is a union of 
atoms in a' for all a G a. (So a' is a coarser partition than a.) If oq, . . . , a„_i G a, we define 
the 77,-cylinder [oq, . . . , a„_i] = r\"=Q F~^ai. It is assumed that F and a separate points in A 
(if X, ?/ G A and x y, then for n large enough there exist distinct n-cylinders that contain 
X and y). 

Let < P < 1. We define a metric dp on A by dp{x,y) = where s{x,y) is the 

greatest integer n > such that x,y lie in the same rz-cylinder. Define g = JF~^ = ^(^!^p-^ 
and set Qk = g g o F ■ ■ ■ g o F^^^. 

A Markov map F is topologically mixing if for all a, 6 G a there exists > 1 such that 
F"a n 6 7^ for all > A^. A Markov map F is Gibbs-Markov if 

(i) Big images property: There exists c > such that m{Fa) > c for all a G a. 

(ii) Distortion: logf^la is Lipschitz with respect to dp for all a G a'. 

Let cto"^ denote the partition of A into length k cylinders a = [oq, . . . , Ofc-i]. The following 
result of [2] is stated explicitly in [201 Lemma 2.4(b)]. 
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Lemma 3.1 Let F be a topologically mixing Gibbs-Markov map. Then ^m{anF — 
m(a)m(6)| < Cr^ m{a)m{hY^'^ for all a E OiQ~^ and all measurable b. I 

3.2 Weighted Lipschitz observations 

Let p G [1,00). We fix a sequence of weights R{a) > satisfying \R\p = 
(^^g^ m(a)i?(a)P)-'^/p < 00. Given $ : A ^ M continuous, define |$|/3 to be the Lip- 
schitz constant of $ with respect to the metric dj3. Let ||$||oo = sup^g^ |$la|oo/-R(a), 
||$||/3 = sup^gQ, |$la|^/i?(a). Let B consist of the space of weighted Lipschitz functions 
with ||$|| = ||$||oo + \\^\\f3 < 00. Note in particular that R E B and = 1. We have the 
embeddings Lip <Z B <Z <Z , where Lip is the space of (globally) Lipschitz functions. 

Lemma 3.2 Let e B with Xv$ = 0. Then \^ - E{^al~^)\p < \\^\i3\R\pl3'' for all k >l. 

Proof (cf. IHOl Lemma 2.4(a)]) Note that E(<l>|«^^^) is constant on partition elements 
a G al~^ with value ^ la^dm, and that |$1„ - f^^dm\^ < diam^(a) < 

\mpR{a)(3K Hence, |$ - E(<l>|«o^-i)|^ < (||$||^/3^)^ Eae«(;- W^(a) = (||$||/3/3l ■ 

3.3 A maximal inequality 

Given a measure-preserving transformation F : A ^ A of a probability space (A,m), the 
transfer (Perron-Frobenius) operator L : ^ is defined by L$ dm = $ o F dm 
for all $ G L^, \E' G This restricts to an operator on L'^, 1 < p < 00. 

Lemma 3.3 Lei $ G ^^(A), 1 < p < 00 with L<^ = 0. Then \ maxo<^<jv-i |E!L£ *°^"| \p < 

Proof Note that L = E{-\F~^Ai) where Ai is the underlying a-algebra. By hypothesis 
the sequence {$ o F^; n > 0} is a reverse martingale difference sequence. Passing to the 
natural extension we obtain an martingale difference sequence {wn] n G Z} such that 
$ o = W-n- By Burkholder's inequality we have | maxi<fc<Ar | ^^^q""^"! Ip — CN^ . 
Setting i = N — k and using stationarity, 

maxo<£<Ar_i I J2e ^ o F"-\ =d maxo<^<Ar_i | YZn^^^ o F"| = maxi<fc<Ar | Xlo^nl, 

proving the result. I 

^This follows from 5', eqns (1.4) and (3.3)] and is stated explicitly in |37l Eq. 1]. 
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3.4 Quasicompactness and the central limit theorem 

Let F : A ^ A be a topologically mixing Gibbs-Markov map with transfer operator L : L} ^ 
L^. It is well-known ^ EO] that L restricts to a bounded operator on weighted Lipschitz 
observables $ e ;B and L{B) C Lip. Moreover L : i3 — > i3 is quasicompact LI = 1 and the 
spectral radius of L restricted to Bq = & B : $ dm = 0} is strictly less than 1. 

We define B"^ to the be the space of R*^ weighted Lipschitz observables, so $ = 
(<l>i,...,$rf) e B'^ if and only if $i G B ioi i = l,...,c?. Similarly, we define Bq. We 
suppress the superscript for spaces such as and Lip relying on the context. 

Proposition 3.4 Suppose that $ G Bq. Then there exists \E' G -Bg and x ^ such that 
^ = '^ + X°F-x and L^' = 0. 

Proof (cf. [SIT, Proof of Corollary 2.3(c)]) Define x = YlT=i L^^- This converges in Bq since 
the spectral radius of L is less than 1. Since L{B'^) C Lip, we have x ^ By construction, 
= 0. I 

Suppose that p>2. Let $ G i^Q C and assume that L$ = 0. Let Sm = ^n<N ^ ° 
and form the d x d matrix S^Sj^. We define the covariance matrix = jj: Jj^S^Sj^ dm = 
dm. 

Lemma 3.5 There exists e > such that 

J^exp{i{u, Sn)N-^) dm - exp{-l{u,^u)) = 0{N-^) 
uniformly for -u G M"^ satisfying \u\ < eN^ . 

Proof We follow a standard argument establishing the central limit theorem with error 
term for systems with quasicompact transfer operator (see pHl Theorem 4.13] and references 
therein). Let S'^^^ denote the unit sphere in W^. Given u G M.'^, write u = tv where t > and 
V G S'^~^. Define the twisted transfer operator : B ^ B hj L^^ = L(e*^"'*^\E'). Recall 
that 1 is an isolated eigenvalue for L. For u small, the spectral radius of L„ is expP(M) 
where P is analytic, -P(O) = 0. Moreover jSHl P- 66] 

P{u) = -l{u,Eu) - iP3{v)t^ + Pi{v,t)t\ 

where P2{v) G M, Pi{y,t) G C are analytic, and there exists e > such that [SHI P- 67] 

&cp{NP{uN-^)) - exp(-|(M, Sm))(1 - iP^{v)t^N'^ = 0{N-^) 

uniformly for |m| < eN^. 
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Now J^exp(i{u, S]\r)N 2) dm = Jj^{L dm, and since the leading eigenvalue of Lu 

is isolated there exists 7 G (0, 1) such that 

J^exp{i{u, SN)N-^)dm- exp{NP{uN-^2)) < 7^, 
uniformly for \u\ < eN^. This completes the proof. I 

3.5 Statement and proof of ASIP for Gibbs-Markov maps 

Theorem 3.6 Suppose that F : A ^ A is a topologically mixing Gibbs-Markov m,ap. Define 
the Banach space B'^ corresponding to weights R & where p > 2. Suppose that $ : A ^ M'^ 
is a mean zero observable in B'^ with partial sums Sn = Yln=i ^ ° FK Then the conclusion 
of Theorem \l.b\ is valid. 

Proof By Proposition 13.41 Sn = Yln=i ° + 0(1) a.e. where m! = 0. Hence without 
loss we may suppose from the outset that L$ = 0. 

Define = $ o and ^„ = a„. Then //„ = ^n+i, • • •)• We verify the hypotheses of 
Theorem 12.11 

Hypothesis ()2.1|) follows from Lemma 13.11 and ()2.2|) is immediate. The remaining hy- 
potheses follow from Lemmas 13.31 13.21 and 13.51 respectively. 

This completes the proof for d > 2. The improved estimate for d = 1 follows from 
Theorem 1A.91 (One hypothesis is different, but it was verified in jHO]-) ' 

Proof of Theorem 11.31 This reduces by standard techniques to a two-sided and then 
one-sided subshift of finite type. The latter is a special case of a Gibbs-Markov map with 
finite alphabet, hence R G L°°. Theorem 11.31 follows from Theorem 13.61 with p = 00. ■ 

4 Applications to nonuniformly hyperbolic systems 

In this section, we prove the vector- valued ASIP for large classes of nonuniformly hyperbolic 
systems. In Subsection 14. 11 we consider nonuniformly expanding systems. In Subsection 14.21 
we consider nonuniformly hyperbolic systems, proving Theorems 11.61 and 11.71 Some open 
problems are described in Subsection 14.31 

4.1 Nonuniformly expanding systems 

Let (M, d) be a locally compact separable bounded metric space with Borel probability 
measure t] and let / : M ^ M be a nonsingular transformation for which rj is ergodic. 
Let A C M be a measurable subset with //(A) > 0. We suppose that there is an at most 
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countable measurable partition {A^} with f]{Aj) > 0, and that there exist integers Rj > 1, 
and constants A > 1; C > and 7 G (0, 1) such that for all j, 

(1) f^^ : Aj — s> A is a (measure-theoretic) bijection. 

(2) d{f^^x, f^^y) > \d{x,y) for all x,y E Aj. 

(3) d{fx, fy) < Cdif^^x, f^^y) for all x,y e Aj, k < Rj. 

(4) gj = "* — - satisfies | log gj{x) — log gj{y) \ < Cd(x,y)"' for almost all x,y E A. 

(5) ^.^(A,) < 00. 

A dynamical system / satisfying (l)-(5) is called nonuniformly expanding. 

Define the return time function R : A —>■ Z+ by R\j>^^ = Rj and the induced map F : A A 
by Fy = f^^^\y). It is well-known that there is a unique invariant probability measure m 
on M that is equivalent to rj. 

Theorem 4.1 Let f : M ^ M be a nonuniformly expanding map satisfying (l)-(5) above. 
Assume moreover that R G U'{A), p > 2. Let (p : M ^ M.^ be a mean zero Holder observation 
with partial sums Sjy = Yln=i ° /'' ■ Then the conclusion of Theorem \l.(A is valid. 

Proof This is identical to the proof of [^01 Theorem 2.9] so we just sketch the main steps. 
The induced map F : A ^ A is a topologically mixing Gibbs-Markov map with respect to 
the partition a = {Aj}. The induced observable $ : A ^ M"' given by $(?/) = XlS''"^ 4'{f''y) 
is weighted Lipschitz and satisfies the ASIP by Theorem IH.fil 

If F : A — i> A were the first return map, then the result would follow immediately 
from 1^21 Theorem 4.2] (see also [201 Theorem B.l]). The general result is proved by passing 
to a Young tower 03] which is a Markov extension of / : M — ^ M for which F is the first 
return map. ■ 

Remark 4.2 (a) The regularity assumption on (j) in Theorem 14.11 can be replaced by the 
more general assumption that the induced observable $ is weighted Lipschitz (with respect 
to the metric defined on the Gibbs-Markov system A). 

(b) A similar result holds for nonuniformly expanding semifiows Corollary 2.12]. 

Naturally, Theorem 14.11 includes uniformly expanding and piecewise expanding maps 
where the partition is finite (with p = 00). Further examples of nonuniformly expand- 
ing maps to which Theorem 14.11 applies include Alves-Viana maps, Liverani-Saussol-Vaienti 
(Pomeau-Manneville maps), multimodal maps, and circle maps with a neutral fixed point, 
see jSUl Section 4]. 
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4.2 Nonuniformly hyperbolic systems 

As was the case in jSU], the resuhs in this paper apply to dynamical systems that are 
nonuniformly hyperbolic in the sense of Young \44]l with return time function R G L^, p > 2. 

Let / : M ^ M be a diffeomorphism (possibly with singularities) defined on a Rie- 
mannian manifold {M,d). We assume from the start that / preserves a "nice" probability 
measure m (one of the conclusions in Young is that m is a SRB measure). 

Fix a subset A C M and a family of subsets of M called "stable disks" {VT**} that are 
disjoint and cover A. The stable disk containing x is labelled W{x). 

(Al) There is a partition {Aj} of A and integers Rj > 1 such that f^^{W^{x)) C W'^{f^^x) 
for all X G Aj. 

Define the return time function i? : A — > Z+ by R\j>^^ = Rj and the induced map 

F : A — > A by F{x) = f^^^\x). Form the discrete suspension map / : A — A where 
/(x, i) = {x,i + 1) for i < R{x) — 1 and f{x, R{x) — 1) = {Fx, 0). Define a separation time 
s : A X A ^ N by defining s{x,x') to be the greatest integer n > such that F^x,F^x' 
lie in the same partition element of A for = 0, . . . , n. (If x, x' do not lie in the same 
partition element, then we take s{x,x') = 0.) For general points p = {x,i),p' = {x',i') G A, 
define s{p,q) = s{x,x') if i = i' and s{p,q) = otherwise. This defines a separation time 
s : A X A — i> N. The projection vr : A ^ M, 7i{x, £) = f^x, satisfies vr/ = fir. 

(A2) There is a distinguished "unstable leaf" C A such that each stable disk intersects 
in precisely one point, and there exist constants C > 1, a G (0, 1) such that 

(i) rf(/"x, /"y) < Ca", for all y G W'{x), all n>0, and 

(ii) c/(/"x, /"?/) < Ca'^'^'y^ for all x,y and a\\ < n < R. 

Let A = A/ ~ where x ~ x' if x G W''^{x') and define the partition {Aj} of A. We 
obtain a well-defined return time function i? : A — > Z+ and induced map F : A ^ A. Let 
/ : A ^ A denote the quotient of / : A — A where (x, £) is identified with {x',i') if £ = i' 
and x' G W''^{x). Let vf : A ^ A denote the natural projection. The separation time on A 
drops down to a separation time on A. 

(A3) The map F : A — * A and partition {Aj} separate points in A. (It follows that dg{p, q) = 
Q^{p^i) defines a metric on A for each 6 G (0, 1).) 

(A4) There exist invariant probability measures m on A and m on A such that 

(i) TT : A ^ M and 7f : A ^ A are measure-preserving; and 

(ii) / : A — i> A is nonuniformly expanding in the sense of Subsection 14. II with induced 
map F : A — > A. (Conditions (2) and (3) are automatic.) 
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Proof of Theorem 11.61 This reduces, as in the proof of [201 Theorem 3.4], to the ASIP 
for the nonuniformly expanding map / : A — A and hence follows from Theorem 14.11 I 

Remark 4.3 Again, the regularity assumption on cf) can be relaxed, and the result extends 
to nonuniformly hyperbolic flows. 

Large classes of billiard maps and Lorentz flows, surveyed in pj,j satisfy the vector- 
valued ASIP. These include dispersing billiards (with flnite or inflnite horizons) and the 
corresponding Lorentz flows (assuming flnite horizons). 

Proof of Theorem 11.71 By periodicity, we can consider the quotient flow on the compact 
manifold M = (T^ — Q) x S^. The Poincare map f : X X on the compact cross- 
section X = X (^, |) is called the billiard map or collision map. Benedicks & Young [H] 
showed that / is nonuniformly hyperbolic in the sense of Young with R & U for all p > 2. 
By Theorem 11.61 the vector- valued ASIP holds for / with p = oo. The collision time 
is uniformly bounded and piecewise Holder, so it follows from that the vector-valued 
ASIP holds for the Lorentz flow on M. Now take as an M^-valued observable the velocity 
coordinate v : M ^ S^. This is piecewise Holder, and the lifted position in is given by 
g(T) = /q o ftdt. Finally, nonsingularity of the covariance matrix was proved in ■ 

Chernov & Zhang ^2] consider three classes of billiards with slow mixing rates. The flrst 
and third classes are not covered by our results since it is shown only that R G L'^~^. The 
second class of Bunimovich-type billiards treated in jT21 satisfles R G The vector- valued 

ASIP for such billiards (and the corresponding flows) is hence a consequence of Theorem II. 61 
with error f5 = 

As in pUj, Theorem 11.61 also applies to Lozi maps and certain piecewise hyperbolic maps, 
Henon-like maps and partially hyperbolic diffeomorphisms with mostly contracting direction. 

A further important class of dynamical systems is singular hyperbolic flows [221 • Theo- 
rem [T21 does not apply directly to such systems, but it establishes the vector- valued ASIP 
(with p = oo) when combined with the techniques in Holland & Melbourne 

4.3 Open problems 

Given a (non) uniformly hyperbolic flow ft, the time-one map /i is only partially hyperbolic. 
For such maps the Gibbs-Markov/suspension formalism breaks down so the results in [20] 
and in this paper do not apply. By different methods, Melbourne & Torok [^ proved that 
the scalar ASIP is typically valid for the time-one map of an Axiom A flow. They used rapid 
mixing properties to reduce to a reverse martingale difference sequence. Following [Til l^^!- 
this leads to the ASIP in reverse time and hence forwards time (since the class of such flows 
is closed under time reversal). Similarly, the scalar ASIP for the time-one map of the planar 
periodic Lorentz gas with flnite horizons is typically valid (since the flow is typically rapid 
mixing [201 class of flows is closed under time reversal). 
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Problem 1 Prove that the vector-valued ASIP holds (at least typically) for time-one maps 
of Axiom A flows and/or planar periodic Lorentz gas with finite horizons. 

Generally speaking, the hypotheses for a nonuniformly hyperbolic system are not time- 
symmetric so ^1 1^ |M] fails. 

Problem 2 Obtain results on the scalar ASIP for time-one maps of nonuniformly hyperbolic 
flows and/or singular hyperbolic flows. 

Remark 4.4 (a) The Banach space-valued ASIP of |3] applies to Problem ^ with the 
caveats mentioned in Remark ll.iur a). In particular, the d- dimensional functional LIL is 
typically valid. These results do not apply to Problem |21 

(b) The (vector-valued) functional central limit theorem is typically valid in Problems ^ 
and 121 (combining the arguments in Section 3.3] and pTTj). 

A Scalar ASIP with error term 

In this appendix, we prove a scalar ASIP using martingale approximation and the method 
of Strassen 41J. This is precisely the result |2H1 Theorem 7.1] used in [201, but our purpose 
here is to obtain a better error term under assumptions appropriate for dynamical systems. 
This improves Theorem 12. II when d = 1. 

We assume the conditions of Section |21 except that (|2.5|) is replaced by 

E{En<NVnr = N + 0{N'/'). (A.l) 

Define {yj} as in Section IT^ In particular. Theorem I2.1UI is unchanged. 

Law of large numbers for y'j 

Lemma A.l Let 7 = max{i^, ig}. Then Y.j<M^ Ey] = N + 0{N^+"'). 

Proof (cf. [38, Lemma 7.3.5]) By (jSIl, ^(E„<7v^n)^ = (^n where a% = N{1 + 0{N-^/^)). 
Write 

Pm Pm 
n<N n<PM n=N+l u-CPm j<M j<M n=N+l 

Then 

— ||Ej<M ^i||2 ||En<PM(^" ~ ^^"-^IL ~'~ l|En=Ar+l ^n||2- 
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By Proposition ITE || Z]n<iv('7n ~ Vin)h < 1- By the proof of LemmaEIEl II ^j<M ^jWl < 
Mi+2° and so || E^xAZ-^ilb < A^(|+")/(i+Q). By stationarity and ^Kl\f . \\ J2n=N+iVn\\l = 
IIE„<P,,-^^n||^ Pm-N « iVQ/(i+«). Hence ||E,<m%II2 = a^, + 0{N^) and 
E{^j<M Vjf = N + 0(A^5+7). Also, as in the proof of Lemma|2II21 Y^i^j Eytyj < 1. ■ 



Corollary A.2 Let f3 = max{i±^, i±||±i^}. Then E,<m^ Ey] = N + 0{N^^). I 

Lemma A.3 Lei /3 = (| - | + iQ)/(l + Q) for 2 < p < 4 (and P={1 + |g)/(l + Q) for 
p> A). Then for any e> 0, J^jkmn y] ~ -^^1 ^ N'^'^+^ a.e. 

Proof The value of e below may change from line to line. Define 



Eyl |y2_E^2|<^-i+Q+. 



otherwise 



Note that P{wj ^ - Ey]) = P{\y] - Ey]\ > < 2||?/J2/^-i+Q+e ^ ^-(i+e) ^Yiidi is 

summable, so by Borel-Cantelli wj fails to coincide with y^ — Eyj only finitely often. Hence 
it suffices to estimate Ej<M'"^i- ^'^ ^^^^ t)y estimating Ejxa/'^j ^'^'^ J2j<M ^'^j where 
= Wj — Ewj. 

Again Ei^jEwiWj < 1. Also, < \wf\\\w^f''^^\\^ < ll^'ll^lkjll^''^' < j"^'"', 

where /2 = 3- 2 + 2(5 + e. Hence ^(Ei=m*j)^ < - m^. By Lemma ITHl Ejxm *i < 
Mi^ < Ari^?/(i+Q) for any e > 0. 

Let A = {\y] - Ey]\ > Then 

Ew, = -E[{y^ - Ey])lA} « \\y] - Ey%/4U\Up-2) 
where S = - — e + Q. Hence Ejxm E'^j ■ Thus, it suffices that 2/3 = max{|_R/ (1 + 

Q),5/(i + g)} = |i?/(i + Q). " I 



Martingale approximation 

Lemma A. 4 Set Cj = C\ = (r{yi, . . . ,yj} . There is a martingale difference sequence 
{Yj, Cj} such that yj = Yj + Uj — Uj^i, where \\uj\\q <^ f-'" for all 2 < q < p. 
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Proof (cf. [HHl Lemma 7.4.1]) Define Uj = XlfcLo A-i)- estimate 
\\E{yj^k\Cj_i)\\g which we write for convenience as Note that 

E\E{y\C)\^ = E{E{y\C)Eiy\C)\Eiy\C)r'] = E{E{yEiy\C)\Eiy\C)r'\C}] 
= E{yEiy\C)E{y\Cr-'}. 

Write i + i = 1. Then ^ + i < 1, so by Lemma 

q s p s ' ' ' 

E\E{y\C)\'^ < WyUEiylCy-Xf'^^^'r _ 

Note that \\E{y\CY-^\\s = {E\E{y\C)\'^)^'^ , and so dividing both sides by this yields 
\\E{y\C)\\g < ||2/||pr(J+^)". Since E^o^^^^''^" < J^^" ^ the estimate for \\uj\\q follows (in- 
creasing f slightly). 

At the same time, it follows immediately that YlT=o < °° which guarantees 

(see eg. ^Hl Lemma 2.1]) that Yj is a martingale difference sequence. I 

Corollary A.5 EjkmM " ^i) ^ ^ ^J■<A/^.(l/| - ^/) < 

Proof We have Y^j^MiVj - Yj) = Ui - Um+i, and hence certainly | Y^j^MiVj - ^ 
J2j>i I'^jl- Lemma rA.4l J2j>i I'^ili < oo so that X]j>i I'^jl < ^ proving the first 
statement. 

Set vj = u, - u,+i. Then F/ - y] = v] - 2y,v^. Now ^E,"i^| = E," i < 
^oo^ ~2r < OO by Lemma E31 Hence T^jkm^j < EJi^i < a.e. Finally, T.jKAiVj'^j < 
(E,<M ^ (E,<M ^ « (E,<M ^ « "^^ by Theorem EIDl " ■ 

Lemma A. 6 Let Xk be a martingale difference sequence and m G (1,2]. Suppose that 
bi<b2< ^ oo. IfT.kKnK'^ElXkl"' < oo, then t2k<n^k = o{bn) a.e. 

Proof For m = 2, this is explicit in |20, p. 238]. For m G (1, 2) it follows from a standard 
martingale result. Chow ^H]; combined with Kronecker's lemma; this is implicit in the proof 
of IMl Lemma 7.4.4]. ■ 

Lemma A.7 Let /? = (^ + \Q)/{1 + Q) for 2 < p < A (and (3 = {\ + \Q)/{1 + Q) for 
p>A). Then for any e > 0, J2j<Mr,iE{Y^^\Cj^i) - F/) < N'^'^+' a.e. 

Proof Define Rj = E{Y^^\Cj_i) — Yj . Suppose that > 2/p + Q and choose q < p so that 
7 > 2/g + Q. Then 

(_^-7)-g/2^|^.|<?/2 ^ j-yi/^E\Yj\i < j-i-r-QM^^ 
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hence J2f^^{j'f)-''/'^E\Rj\'i/^ < oo. Note that | G (1,2] and Rj is a martingale difference 
sequence, so it follows from Lemma IA.6I that X]j<M -^i ^^"^ ^'^^ result follows from 

Proposition 12.71 ■ 

We now apply Strassen's martingale version of the Skorokhod embedding [¥1J. There 
exist non- negative random variables Tj such that the sequences {J2j<M^j^ ^ > l} and 
{^(X]j<M -^j)' ^ ^ 1} are equal in distribution. 

Proposition A. 8 For (3 as in Corollary \A.S\ Lemma \A.S\ Lemma lA.Tl and any e > 0, 

E,<M.T, - N <^ N'^^^ a.e. 

Proof Let Am = cr{W(t) : <t < X]j<A/^i}' '^m C Am- Each Tj is ^j-measurable, 
E{Tj\Aj-i) = E{Y^\Cj_i) a.e., and ETj <^ E\Yj\'^P. In particular, the argument in 
Lemma fA. 71 implies that 

E,<m{Tj - E{TMj-i)) « a.e. (A.2) 

Now write 

The result follows from ()A.2|1 . Corollaries IA.2I and IA.5| and Lemmas IA.3I and I A. 71 I 

Theorem A. 9 Let P = ^ + \ for 2 < p < 4 and /3 = | for p > 4. For any e > 0, 

En<NVn = W{N) + 0{NPh a.e. 

Proof By Theorem 12. 101 and Corollarv IA.51 it suffices to prove that YljKM^j ~ W{N) + 
0{N'^+^'). Equivalents, W^(Ei<M^i) = ^(^) + 0{N'^+^'). Hence the resuh follows from 
Proposition IA.81 I 
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